Rational Decisions, Random Matrices and Spin Glasses 



OO 
c3 



Stcfano Galluccio^, Jean-Philippe Bouchaudt'* and Marc Potters^ 
1 Science & Finance, 109-111 rue Victor Hugo, 92523 Levallois Cedex, FRANCE 
* Service de Physique de I'Etat Condense, Centre d' etudes de Saclay, 
Orme des Merisiers, 91191 Gif-sur-Yvette Cedex, FRANCE 
(February 1, 2008) 

We consider the problem of rational decision making in the presence of nonlinear constraints. 
By using tools borrowed from spin glass and random matrix theory, we focus on the portfolio 
optimisation problem. We show that the number of "optimal" solutions is generically exponentially 
large: rationality is thus de facto of limited use. In addition, this problem is related to spin glasses 
with Levy-like (long-ranged) couplings, for which we show that the ground state is not exponentially 
degenerate. 
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It is often hoped that science can help in choosing the 
right decision. A rational decision is usually thought of 
as the solution which optimizes a certain utility (or cost) 
function, which is supposed to quantify the performance 
of a given strategy. A simple example is that of portfolio 
optimization: given a set of financial assets, character- 
ized by their average return and their risk, what is the 
optimal weight of each asset, such that the overall port- 
folio provides the best return for a fixed level of risk, or 
conversely, the smallest risk for a given overall return? It 
is frequently the case that the optimal solution is unique, 
so that 'rational' operators should choose this particular 
one, unless they arc inefficient. In the case of portfolio 
selection, the only free parameter is the proportion of 
the "riskless asset"; all portfolios should thus look very 
much alike. This is at the heart of the so called Capital 
Asset Pricing Model, (CAPM), one of the cornerstones 
of modern theoretical finance 13] . 

In this Letter we will show that, surprisingly, there are 
cases where the number of "rational" decisions is expo- 
nentially large (for example in the number of assets) - 
much like the number of ground states in spin glasses or 
other disordered systems M. This means that there is 
an irreducible component of randomness in the final de- 
cision: the degeneracy between this very large number 
of possibilities can only be lifted by small, 'irrational', 
effects. 

More precisely, the average return Rp of a portfolio 
P of N assets, is defined as Rp = ^2^ = iPiRi, where pi 
(i = 1,---,N) is the amount of total capital invested 
in the asset i, and {Ri} are the expected returns of the 
individual assets. Similarly, the risk on a portfolio can 



be associated to the total variance op = Yli=iPi a i 



Si^y PiPj°~ij > where of are the single asset variances and 
o~ij are the inter-asset covariances. Alternatively, one can 
define op = j=iPi^ijPj where Cy is the covariance 
matrix. 

Generally speaking, the goal of each rational investor 
consists in maximizing the portfolio return and/or in 
minimizing its variance (or risk). This optimization pro- 



cedure, or portfolio selection, as investigated in its sim- 
plest version by Markowitz jj) , results in a set of all pos- 
sible "optimal" couples {Rp, op} which all lie on a curve 
called the efficient frontier (ef). For a given degree of 
risk, there is thus a unique selection of assets {p*}f=i 
which maximizes the expected return. On the opposite, 
we will show that in some cases this simple scenario does 
not hold: a very large number of compositions {pi}^Li 
can be "optimal" , in the sense defined above. Further- 
more, these optimal compositions can be very different 
from another. 

In order to illustrate this rather general scenario with 
an explicit example, we shall investigate the problem of 
portfolio selection in the case one can buy, but also to 
short sell stocks, currencies, commodities and other kinds 
of financial assets. This is the case of 'futures' markets 
or margin accounts. The only requirement is to leave a 
certain deposit (margin) proportional to the value of the 
underlying asset 0] . This means that the overall position 
on these markets is limited by a constraint of the form: 
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where Wi is the price of asset i, W the total wealth of 
the operator, and 7 the fraction defining the margin re- 
quirement. Here, pi is the number of contracts on asset 
i which are bought (pi > 0) or sold (pt < 0). It is worth 
stressing that it is the nonlinear form of the above con- 
straint that makes the problem interesting, as we shall 
see below. At this point we simply note that the optimal 
portfolio p* , which - say - minimizes the risk for a given 
return, can be obtained using two Lagrange multipliers 
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Without loss of generality, one can always set Wi = 1. 
Defining Si to be the sign of p*, one gets: 
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(3) 



others. The matrix Ca can thus be written as C 



M 



where C _1 is the matrix inverse of C. Taking the sign of 
this last equation leads to an equation for the Si's which 
is identical to those defining the locally stable configura- 
tions in a spin-glass H : 



Si = sign 



H, 



N 



(4) 



where Hi = i^Ylj C^Rj and = pC^ 1 are the ana- 
logue of the "local field" and of the spin interaction ma- 
trix, respectively. Once the Si's are known, the p* are 
determined by Eq. (||); v and p are fixed, as usual, so 
as to satisfy the constraints. Let us consider, for sim- 
plicity, the case where one is interested in the minimum 
risk portfolio, which corresponds to v — 0. [By analogy 
with spin glasses, the following results are not expected 
to change qualitatively if v ^ ||]. In this case, one sees 
that only \p\ is fixed by the constraint. Furthermore, the 
minimum risk is given by 7Z = p 2 Y^jk SjCjkSk, which 
is the analogue of the energy of the spin glass. It is now 
well known that if J is a random matrix, the so called 
"tap" equation (^), defining the metastable states of a 
spin glass at zero temperature, generically has an expo- 
nentially large (in N) number of solutions Note that, 
as stated above, the multiplicity of solutions is a direct 
consequence of the nonlinear constraint (1). 

In what respect can the correlation matrix C be con- 
sidered as random? Here we take a step in complete 
analogy with the original Wigner and Dyson's idea of re- 
placing the Hamiltonian of a deterministic but complex 
system by a random matrix . More precisely, they pro- 
posed to study the properties of one generic member of 
a statistical ensemble M which shares the symmetries of 
the original Hamiltonian. Similarly, in the present sit- 
uation, we would like to see C _1 as a random matrix 
whose elements are distributed according to a given en- 
semble, compatible with some general properties. In our 
case, for instance, we must select C _1 from an ensem- 
ble of positive definite matrices, as all the eigenvalues of 
the covariance matrix are > 0. The choice of a suitable 
statistical ensemble is guided by the following observa- 
tion. The (daily) fluctuations of the asset i, SWi, can be 
decomposed as: 
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,5E a + SW i0 , 



(5) 



where the E a are K independent factors which affects 
the assets differently, and SWio is the part of the fluctu- 
ation which is specific to asset i (and thus independent 
of the E a ). The E a contain all information on how the 
stochastic evolution of a given asset is correlated to the 



M T + D, where D is a positive diagonal matrix, and M 
a N x K rectangular matrix. The above representation 
ensures that all eigenvalues of C are positive. In order 
to simplify the problem, we shall make in the following 
the assumption that D is proportional to the identity ma- 
trix and that the coefficients Mi a are completely random. 
For K — N, C is a member of the so-called Exponential 
Orthogonal Ensemble (eoe) Q, which is a maximum en- 
tropy (least information) ensemble. For any N x K ma- 
trix M, (with N > K), the density of eigenvalues pc(X) 
of C is exactly known in the limit N — > oo, K — > oo and 
Q = K/N fixed ||. In the limit Q = 1 (which we con- 
sider from now on) the normalized eigenvalue density of 
the (square) matrix M is the well known semi-circle law, 
from which the density of eigenvalues of C and J=/xC~ 1 
are easily deduced. In particular, for the situation we are 
interested in, one finds 
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X e]ar, (4 + a)r]. (6) 



where a measures the relative amplitude of the diago- 
nal contribution D and r the width of the distribution. 
The distribution of eigenvalues of J=/iC -1 , pj(x), can 
be easily calculated from Eq.(6). Note that in the case 
where a = 0, pj(x) has a power-law tail decaying as 
pj(x) tx a;~ 3 / 2 . For finite a, however, the maximum 
eigenvalue of J is oc I /a. 




FIG. 1. Smoothed distribution of eigenvalues of C, where 
the correlation matrix C is extracted from N = 406 assets of 
the S&P500 during the years 1991-1996. For comparison we 
have plotted the best fit obtained with the theoretical density 
(6) with a ~ .34, r ~ 1.8 x 10 -4 . Results are qualitatively 
similar in the case of the London or Zurich Stock Exchanges. 

We have studied numerically the density of eigenval- 
ues of the correlation matrix of 406 assets of the New- 
York Stock Exchange (nyse), based on daily variations 
during the years 1991-96, for a total of 1500 days. More- 
over, by repeating our analysis on different Stock Markets 
(e.g. London and Zurich) we have checked the robustness 
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and universality of our results. The observed density of 
eigenvalues is shown in Fig.l together with the best fit 
obtained by assuming Q = 1. The fitting parameters are 
a and the width of the semi-circle distribution r, which 
are found to be, respectively, a ~ .34 and r ~ 1.8 x 10~ 4 . 
The deviation from the theoretical curve is partially due 
to finite N effects which are expected to smear out the 
singularities at A = ar and (4 + a)r. It might also be 
that D has unequal diagonal elements (this would also 
smear out the singularities of pj(X)), or let the number 
K of "explicative factors" be less than TV (a rather likely 
situation). Note that the empirical pc can be rather 
well fitted by a log-normal distribution, suggesting that 
the relevant ensemble has the additional symmetry C 
C _1 , although we have not been able to justify this prop- 
erty. 

The point, however, is not to claim that the density 
of eigenvalues is precisely described by the above model, 
but that it represents a reasonable approximation for our 
purposes. Once the density of eigenstates pj(X) is known, 
the number of solutions of the optimization equation, av- 
eraged over the matrix ensemble, can be computed using 
tools borrowed from spin glasses and random matrix the- 
ory. As discussed in [ fTo|] , the main ingredient is the gen- 
erating function G(x), defined as 
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E>[J]exp{ Tr (JA/2)} ^ exp {N TrG (A/A 7 )} (7) 



where A is any symmetric N x N matrix of finite 
rank, and T>[ 3] = d[J)P([J]) is the probability mea- 
sure over the matrix ensemble. The above formula holds 
for general complex hermitian matrices J For ex- 

ample, in the simplest possible case, where J is ex- 
tracted from a Gaussian Ensemble, one has P([J]) = 
exp{- Tr(J 2 /4cr 2 )}/Z and thus G(x) = x 2 /A. Formally, 
G(x) can be computed in general from pj(X) using a se- 
ries of rather involved transformations Jl2| , which can 
be somewhat simplified by the use of a Hilbert integral 
transform of the eigenvalue density jl3| . 

In the specific case (^), we have been able to compute 
G a (x) exactly for a = 0, with the result Gq(x) = —^—x, 
and perturbatively in the limits a«l and a ^S> 1. Leav- 
ing aside all mathematical details |13] , the results are: 



G Q (2:)~— , a>l, 

G a {x) ~ — \fa~g ( — J , a<l, 



(8) 



where the scaling function g(u) behaves as g(u) ~ \fu for 
«>1 and as g(u) oc u for u -C 1. As a side remark, one 
can show that for Levy random matrices EJ where p(X) 
decays as X~ 1 ~^ for large A, the characteristic function 
behaves as G(x) ~ — (—x) 13 for small x's, and for all 
P < 1. Consequently, the problem we are concerned with 
here (/3 = 1/2) can be seen as the study of the metastable 
states in spin glasses with long-range interactions fl5|,|l3| . 



Once G a (x) is known, one can write the average num- 
ber of solutions Af of the tap equations for large N as 
Af ~ exp{A/(a)}, where /(a) is obtained from a steepest 
descent approximation. Indeed, from Eq.(4), one derives 
the following result, valid for large N [n0|: 
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where 9(u) is the usual Heavyside function of its argu- 
ment. From (8), we find that /(a) very rapidly converges 
to In 2 (the maximum allowed number of solutions) when 
a is large: 



f(a) =ln2-e 



-a 2 /2 
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(12) 



a result which we have numerically confirmed by extract- 
ing J from the proper statistical ensemble. To give an 
idea, one finds f(a = 1) ~ .686 ± .003, which is already 
quite close to In 2 = .693147 . . .. 

For small a, calculations become rather involved, as the 
extremizing set {x* , y* , W* , Z*} corresponds to a min- 
imum and actually lies on the domain's border. The 
final result is that y* = x* 2 , W* = l/(8\/2x* 3 ) and 
Z* = l/{AV2x*), which leads to /(a) cx a 1 / 4 for a < 1. 
We have confirmed this result numerically (see Fig 2). 
Therefore, for a = 0, f(a = 0) = 0, i.e. the number of 
metastable solutions does not grow exponentially with N. 
This result means that the presence of very large eigen- 
values prevents the existence of many metastable states. 
This has an interesting consequence in the physical con- 
text, which was conjectured in |15|: a spin glass with 
a broad distribution of couplings cannot sustain many 
ground states; one can thus expect that the low temper- 
ature phase of these models is rather different from the 
'standard picture' 0. We can show that this is true for 
all < 1 |§; we find in particular that f(a) ~ if(/3)a /3/2 
for small a, with a prefactor K({3) algebraically diverging 
for $ -> 1. For p > 1, finally, f(a = 0) > 0. 

Let us now come back to our main theme. In what 
respect is the above picture a valid description of the 
portfolio optimization problem? In the real situation, of 
course, C is given by the "historical" correlation matrix. 
If our model is correct, then one should expect the num- 
ber of optimal solutions to be exponentially large, even 
for small a's. By extracting true correlation matrices 
of sizes up to N = 20 from the available 406 assets of 
the NYSE (see above), we have indeed numerically found 
that A/ny ~ exp(/NyA) with /ny = .68±.01. This result 
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does not qualitatively change in the case of the London or 
Zurich markets. Despite of the rather low dimensional- 
ity of the involved matrices, with unavoidable finite-size 
effects, the above result is quite well reproduced by our 
theoretical model (which is valid at N ^> 1). In fact, 
by using the fitting (a, r) parameters as in Fig.l, we find 
a theoretical value /th = -63 ± 0.02, not too far from 
the real one. Similarly by extracting C from 172 stocks 
of the London Stock Exchange (LSE) we have found an 
empirical /lse = -43 ± .09 and a corresponding theoret- 
ical value /th = -32 ± .07. Remarkably, our admittedly 
approximate description of the density of states of the 
matrix J seems to account satisfactorily for the observed 
number of "optimal portfolios" . 



■ Numerical simulalii 
Best fit: f(a>=0.84 a 




FIG. 2. The theoretical exponent f(a) plotted vs a (a < 1) 
when C is extracted from the EOE, using an exact enumera- 
tion procedure to find the numbers of solutions up to N — 27. 
Finite size effects are clearly present for the smallest values of 
a. The theoretical line Ka 1 ^ is plotted for comparison, with 
K ~ .84. 

By means of a combination of analytical and numerical 
arguments, we have thus shown that the number of op- 
timal portfolios in futures markets (where the constraint 
on the weights is non linear) grows exponentially with 
the number of assets. On the example of U.S. stocks, 
we find that an optimal portfolio with 100 assets can 
be composed in ~ 10 29 different ways ! Of course, the 
above calculation counts all local optima, disregarding 
the associated value of the residual risk 1Z. One could 
extend the above calculations to obtain the number of 
solutions for a given 1Z. Again, in analogy with [fl0|| , 
we expect this number to grow as exp Nf(lZ, a), where 
f(lZ, a) has a certain parabolic shape, which goes to zero 
for a certain 'minimal' risk 7Z* . But for any small interval 
around 1Z* , there will already be an exponentially large 
number of solutions. The most interesting feature, with 
particular reference to applications in economy, finance 
or social sciences, is that all these quasi-degenerate solu- 
tions can be very far from each other: in other words, it 
can be rational to follow two totally different strategies! 



Furthermore, these solutions are usually 'chaotic' in the 
sense that a small change of the matrix J, or the addi- 
tion of an extra asset, completely shuffles the order of the 
solutions (in terms of their risk). Some solutions might 
even disappear, while other appear. It is worth pointing 
out that the above scenario is not restricted to portfolio 
theory, but is germane to a variety of situations. For ex- 
ample, it is well known in Game Theory that each player 
has a different utility function he must maximize in or- 
der to get the best profit. When, in addition, nonlinear 
contraints are present, we expect a similar proliferation 
of solutions. As emphasized above, the existence of an 
exponentially large number of solutions forces one to re- 
think the very concept of rational decision making. 
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